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Amendments to the Specification 

Please replace the paragraph beginning at page 43, line 17 with the following amended 
paragraph: 

Library screening, probe preparation and PCR: Tatl clones were obtained by screening a 
Landsberg erecta (La-0) 1 phage library (Voytas et al. (1990) Genetics 126: 713-721), using a 
probe derived by PCR amplification of La-0 DNA. The primers for probe amplification were 
based on the three published Tatl sequences (DV0158, 5 '-GGGATCCGCAATTAGAATCT-3 \ 
SEP ID NQ:42 ; DV0159, 5 ' -CGAATTCGGTCC ACTTCGGA-3 ' . SEP ID NG:43 ). Peleman et 
al. (1991) Proc. Natl. Acad. Sci. USA 88:3618-3622. Subsequent probes were restriction 
fragments of cloned Tatl elements, and all probes were radiolabeled by random priming 
(Promega). Long PCR was performed using the Expand Long Template PCR System 
(Boehringer Mannheim) with LTR-specific primers (DVP354, 5'- 
CC AC AAG ATTCTAATTGCGG ATTC-3 \ SEP ID NG:44 ; DVG355, 5'- 
CCGAAATGGACCGAACCCGACATC-3 \ SEP ID NP:45 ). The protocol used was for PCR 
amplification of DNA up to 15 kb. The following PCR primers were used to confirm the 
structure of Tat 1-3: DVC405 (5 ' -TTTCC AGGCTCTTGACGAGATTTG-3 ' . SEP ID NP:46 ) 
for the 3' non-coding region, DVP385 (5 '-CGACTCGAGCTCCATAGCGATG-3 ' J3EQJD 
NG:47 ) for the second PRF of Tat 1-3 (note that the seventh base was changed from an A to a G 
to make an Xhol and a Sail restriction site) and DVG371 (5'- 
CGGATTGGGCCGAAATGGACCGAA-3 \ SEP ID NP:48) for the 3' LTR. 

Please replace the paragraph beginning at page 45, line 28 with the following amended 
paragraph: 

If the Tatl sequences in pDW42 and pDW99 defined retrotransposon insertions, a PBS 
would be predicted to lie adjacent to the 5' Tatl elements in both clones. The putative Tatl PBS 
shares similarity with PBSs of Zeon-1 and another maize retrotransposon called Cinful (see 
below), but it is not complementary to an initiator methionine tRNA as is the case for most plant 



Applicant : David A. Wright et al. Attorney Docket No.: 0841 1-030002 / ISURF 02410-D1 

Serial No. : 09/965,553 

Filed : September 27, 2001 

Page : 3 of 15 

BEST AVAILABLE COPY 

retrotransposons. Additionally, a possible polypurine tract (PPT), the primer for second strand 
cDNA synthesis, was observed one base upstream of the 3 5 Tatl sequence in both phage clones 
(5 ' -GAGGACTTGGGGGGC AAA-3 ' , SEP ID NG:49 ). We concluded from the available 
evidence that Tatl is a retrotransposon, and we have designated the 3960 base insertion in 
pDW42 as Tatl-1 and the 3879 base insertion in pDW99 as Tatl -2. It is apparent that both Tatl- 
1 and Tatl -2 are non- functional. Their ORFs are truncated with respect to the coding 
information found in transposition-competent retrotransposons, and they lack obvious pol motifs. 

Please replace the paragraph beginning at page 53, line 20 with the following amended 
paragraph: 

The Calypso retrovirus-like elements have the same overall structure and sequence 
homology as the previously described Athila and Cyclops elements. The elements are ~12kb in 
length; they have a 5' LTR, a PBS (Primer Binding Site), a gag protein, a pol protein, a spacer, 
an env-like protein, another spacer region, a PPT (Polypurine Tract) and a 3' LTR. The LTRs 
vary from -1 .3 to -1 .5kb in length and characteristically begin with TG and end with CA. The 
PBS is similar to that used by the Athila and Cyclops elements; it is 4 to 6 bases past the 5' LTR 
and matches the 3' end of a soybean aspartic acid tRNA for 18 to 19 bases with 1 mismatch. 
The fact that the sequences of the Calypso primer binding sites are shared with the A. thaliana 
and P. sativum retrovirus-like elements, indicates that this sequence is a unique marker for 
envelope-encoding retroelements. The gag protein extends -850 amino acids and encodes a zinc 
finger domain (characterized by the amino acid motif CxxCxxxHxxxx C SEQ ID NO: 50) and a 
protease domain (characterized by the amino acid motif LIDLGA , SEQ ED NO:51 ). These 
domains are located at approximately the same positions within gag as in other retroelements. 
The -600 amino acid reverse transcriptase region follows gag and has the conserved plant 
retrovirus-like motifs which approximate the following amino acids: KTAF (SEP ID NQ:52) , 
MP/SFGLCNA (SEQ ID NO:53) , V/I/MEVFMDDFSAW/I SEOIDNO:54 , 
FELMCDASDYAI/VGAVLGOR (SEP ID NO:55) . and 
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YATT/IEKEL/MLAIVF/YAL/FEKFR/KS YLI/VGSR/KV (SEP ID NO:56) , respectively. The 
-450 amino acid integrase domain has the plant retrovirus-like integrase motifs that approximate 
HCHxSxxGGH30xCDxCQR (SEP ID NP:57) for the Zn finger as well as two other motifs that 
approximate WGIDFI/V/MGP (SEP ID NO:58). and PYHPQTxGQA/VE (SEP ID NP:59) . 
After integrase, there is a ~0.7kb spacer then a -450 amino acid env-like protein coding region. 
The env-like protein of the Calypso elements is well conserved through most of the PRF but 
conservation decreases toward the C-terminus. The conservation includes 2 or 3 presumed 
transmembrane domains and a putative RNA splice site acceptor. The env-like protein is 
followed by a ~2kb spacer then a polypurine tract with the approximate sequence 
ATTTGGGGG/AANNT (SEP ID NP:60) . The 3' LTR starts immediately after the final T of 
the PPT. 

Please replace the paragraph beginning at page 55, line 31 with the following amended 
paragraph: 

Among the Calypso elements, seven have been characterized that encode env-like PRFs. 
These env-like PRFs form four families that have a high degree of overall sequence similarity 
beginning at the first methionine and continuing for three quarters of the PRF; sequence 
similarity falls off dramatically near the C-terminus. The amino acid sequence at the first 
methionine has the consensus sequence QMASR/KKRR/KA (SEP ID NP:61) , which appears to 
be a nuclear targeting signal, however, the program PSPRT only predicts a 0.300 confidence 
level for this targeting role (Nakai and Horton (1999) Trends Biochem. Sci. 24:34-36). A similar 
sequence (ASKKRK , SEP ID NP:62 ) is found at the same position in the env-like PRF of 
Cyclops2, suggesting that it serves a similar purpose. No other potential targeting peptide stands 
out from the sequence that has been analyzed so far. There is a conserved region that is 
predicted to be a transmembrane domain near the center of the Calypso env-like protein and a 
second transmembrane domain located at variable positions near the C-terminus. These may be 
the fusion and anchor functions of a TM peptide. It should also be noted that five of the seven 
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ORFs are predicted to have a transmembrane domain that is just before and includes the first 
methionine. This N-terminal transmembrane domain may be a secretory signal of an SU peptide. 
The program Tmpred estimates these transmembrane domains to be significant based on a score 
>500 (Hofmann and Stoffel (1993) Biol. Chem. 374:166). These three transmembrane domains 
are found in the Cyclops2 env-like protein at similar locations but at a reduced significance 
score. Another feature of the Calypso env-like ORF is the conserved splice site that is predicted 
to be at the first methionine by the program NetGene2 v. 2.4 with a confidence level of 1 .00 
(Hebsgaard et al. (1996) Nucleic Acids Res. 24:3439-3452); Brunak et al. (1991) J. Mol. Biol 
220:49-65). There are other less preferred putative splices sites in the region, but only the splice 
site near the methionine is optimally placed and conserved in all seven env-like ORFs. 



